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Abstract 

For a field F, let R{n,m) be the ring of invariant polynomials for the action of SL(n,F) x 
SL(n,F) on tuples of matrices - (A, C) £ SL(n,F) x SL(n, F) sends (B±,... ,B m ) £ M(n, F)® m 
to {AB\C ~ l ,..., AB TO C _1 ). In this paper we call R(n,m) the ring of matrix semi-invariants. 

Let /3(R(n,m )) be the smallest D s.t. matrix semi-invariants of degree < D generate R(n,m). 
Guided by the Procesi-Razmyslov-Formanek approach of proving a strong degree bound for 
generating matrix invariants, we exhibit several interesting structural results for the ring of 
matrix semi-invariants R(n, to) over fields of characteristic 0. Using these results, we prove that 
/ 3(R(n,m )) = Q(n 3 / 2 ), and ft(R(2,m)) < 4. 

1 Introduction 

Let F be a field. In this article, we study the ring of invariant polynomials for the action of 
SL(n,F) x SL(n, F) on tuples of matrices - (A,C) € SL(n,F) x SL(n,F) sends (B \,..., B m ) € 
M(n,F)® m to (ABiC ,..., AB m C~ l ). Denoted by R(n,m), we call this ring the ring of matrix 
semi-invariants , as (1) it is closely related to the classical ring of matrix invariants |Pro76| (see below 
for the definition, and |Dom00bllANS07| for the precise relationship between these two rings); and 
(2) it is the ring of semi-invariants of the representation of the m-Kronecker quiver with dimension 
vector (n,n). Here, the ?7i-Kronecker quiver is the quiver with two vertices s and t, with m arrows 
pointing from s to t. When m = 2, it is the classical Kronecker quiver. The reader is referred 
to [DW0011SVdB01[|DZ01| for a description of the semi-invariants for arbitrary quivers. 

Let /3(R(n,m )) be the smallest integer D s.t. matrix semi-invariants of degree < D generate 
R(n,m), and let a(R(n,m)) be the smallest integer D s.t. matrix semi-invariants of degree < D 
define the nullcone of R(n,m). 

Recently, the quantity cr(R(n, m)) has found several applications in computational complexity 
theory. In particular, if a(R(n,m)) is polynomial in n and m, then it follows that (1) division gates 
can be efficiently removed in non-commutative arithmetic circuits with division ( IIIW1 li ): and (2) 
there exists a deterministic polynomial-time algorithm that decides whether the non-commutative 
rank of a square matrix of linear forms is full or not over the rational number field ( |IQS15[IOurQ4| ). 
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We refer the interested reader to the cited works for further explanation on these connections. At 
present, the best bound for a(R(n,m)) is n\/\n/ 2]! over large enough fields |IQS15[ . 

One natural way to upper bound a(R(n,m)) is of course to upper bound j3(R(n,m)). Over 
algebraically closed fields of characteristic 0, by Derksen’s result |Der m, /3(R(n,m)) < max(2,3/8- 
n 4 • a(R(n,m)) 2 ). Therefore, if a(R(n,m)) is polynomial in n then /3(R(n,m)) is polynomial in 
n as wello Another compelling reason to examine /3(R(n,m )) is because of the following strong 
upper bound on (5 for the ring of matrix invariants over fields of characteristic 0. 

Consider the action of A € SL(n,F) on (B\, ..., B m ) € Af(n,F)® m by sending it to (ABiA~ 1 , 

..., AB m A~ l ). The invariant ring w.r.t. this action is denoted as S(n, m), and elements in S(n, m) 
are called matrix invariants. The structure of S(n, m) is well-understood over fields of characteristic 
0: the first fundamental theorem (FFT), the second fundamental theorem (SFT), and an n 2 upper 
bound for /3(S(n,m)) have been established in 1970’s by Procesi, Razmysolov, and Formanek 
Pro7(i . Raz74 ,-. lFor86) . Note that when applied to S(n,m) over characteristic 0, Derksen’s bound 
yields / 3(S(n,m )) < max(2,3/8 • n 2 • a 2 ) and a(S(n,m)) = n°^ n \ far from the n 2 bound as 
mentioned above. 

On the other hand, for R(n,m), as far as we are aware, the best bound for /3(R(n,m)) is 
0(n 4 • (n!) 2 ) over algebraically closed fields of characteristic 0, by combining the abovementioned 
results of jDerOlj and |IQS15| . 

Our goal in this paper is to prove a better bound on f3(R(n,m)) by following the approach of 
Procesi, Razmyslov, and Formanek. Therefore, in the following we restrict ourselves to fields of 
characteristic 0. While we do not achieve this, we describe several structural results for R(n,m), 
including the second fundamental theorem (Proposition [Tj) . Though some of these structural results 
should be known to experts, we could not find them in the literature, so we provide full proof 
details. Furthermore, these results allow us to prove that /3(R(n,m)) has a lower bound fi(n 3//2 ) 
(Proposition fTTl) . 

One technical result that we believe is new, is Proposition [T21 Roughly speaking, there exists a 
linear basis of matrix semi-invariants, such that each polynomial in this basis can be associated with 
a bipartite graph. An upper bound of D on /3(R(n,m )) would follow, if we can prove that when 
the degree is > D, modulo the linear relations (as described in the second fundamental theorem), 
every polynomial can be written as a linear combination of those basis elements whose associated 
graphs are disconnected (Proposition [9]). Proposition [12] then states that when D > n 2 , every 
matrix semi-invariant of degree D can be written as a linear combination of those basis elements 
whose associated graphs are disconnected or non-simple (e.g. with at least one multiple edges) 
modulo the linear relations. 

As an immediate consequence of Proposition [12] we prove that f3(R(2,m)) < 4 over fields of 
characteristic 0 in Theorem 13 While this bound is known from Domokos’ explicit generating set 
for /3(R(2,m)) [DomOOaj . we think this suggests the validity of the approach of Razmyslov, Procesi 
and Formanek when applied to matrix semi-invariants. Furthermore, since in this approach we do 
not exhibit invariants explicitly, we believe this approach will generalize to larger n. 

More previous works. Here we collect a few more previous works on matrix semi-invariants. To 
start with, since as mentioned, matrix semi-invariants are just semi-invariants of the representations 
of the m-Kronecker quiver, results on semi-invariants of quivers apply to matrix semi-invariants, e.g. 

1 Over fields of characteristic 0, by a theorem of Weyl |Wey97 |, /3(R(n , m)) < /3(R(n , n 2 )). Therefore if /3(R(n, m)) 
is polynomial in n and m then /3(R(n, m)) is just polynomial in n. See [DomO.f] for more on this. 
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the first fundamental theorem |DW00l[SVdB0HlDZ0l] . Let us give one description from [DZOlj : 
suppose R(n,m) C F[xR ; ] where i,j £ [n], k £ [m], and x) ■ are independent variables. Let 

X},. = (x\ k j). Then for A 1; ..., A m £ M (d. F), det(Ai <8> X\ + • • • + A m <g> X m ) is a matrix semi¬ 
invariant, and every matrix semi-invariant is a linear combination of such polynomials. Therefore, 
(. B\ ,..., B m ) is in the nullcone, if and only if for all d £ Z + and all (A\ ,..., A m ) £ M(d , F)® m , 
A\ (g> B\ + • • • + A m <g> B m is singular. 

A description of the nullcone of R(n,m ) can be found in {BD06ilANS07j . For certain small m 
or n, explicit generating sets of R(n,m) have been computed in e.g. jDom00bliDom00allDD12] . In 
these cases, elements of degree < n 2 generate the ringcl 

Several results for matrix invariants over fields of positive characteristics are known: FFT was 
established by Donkin in |Don92lfDon93| . an n 3 upper bound for a can be derived from j jCIW971 
Proposition 9], and Domokos in [ Dom02 , DKZ02 ] proved an upper bound 0{rvm n ) on /3. 

Organization. In Section [2] we briefly review the Procesi-Razmyslov-Formanek approach, and 
give an overview of our results. Then we describe the second fundamental theorem (Section [3]) , the 
FS dn bimodule structure (Section HD, and the S dn diagonal action (Section [5]). Finally, in Section El 
we use these structural results to prove a lower bound on f3(R(n,m)), and that /3(R(2,m)) < 4. 

2 An overview of the structural results 

An outline of the Procesi-Razmyslov-Formanek result. To motivate the results to be 
presented, we first review the n 2 bound for S(n, m) over characteristic-0 fields by Razmyslov [Raz74j 
and Procesi |Pro76j . and further elaborated by Formanek (For86| . Recall that S(n,m ) is the 
invariant ring of A € SL(n,Q) on (Bi,...,B m ) € M(n,Q)® m by sending it to (AB±A~ 1 , ..., 
AB m A~ 1 ). Our exposition follows the one by Formanek [For86] . and requires certain basic facts 
about the group algebra of the symmetric group as described therein. 

Let Xi, i £ [m] be an n x n matrix of indeterminants. Then Tr^Xjj • Xi 2 . Xj k ), k £ Z + , 

ij £ [m] generate all matrix invariants. The n 2 upper bound implies that invariants of this form 
with k < n 2 generate the ring of matrix invariants already. 

The proof for this upper bound starts with identifying multilinear matrix invariants in the 

form Ti^Xjj • X\ 2 . X , lk ), ij £ [m] pairwise distinct, with the permutations of a single cycle 

(*i, 12 -, ■ ■ ■ Xk)- Such identification gives a correspondence of multilinear invariants of degree d with 
the group algebra FS^. 

The second step is to use the second fundamental theorem of matrix invariants, which suggests 
that the linear relations are spanned by the two-sided ideals indexed by partitions of length > n. 
At this point, it is clear that 6 is a degree bound, if and only if, for any d > b, the reducible elements 
(those permutations whose cycle types are not of a single cycle of length d) together with the linear 
relations span the whole space. Then, by using the standard bilinear form for CS^ (setting the 
permutations as an orthonormal basis), this is equivalent to showing that the space J spanned 
by the irreducible elements (those permutations whose cycle types are a single cycle of length d), 
and the space I< n spanned by the two-sided ideals indexed by partitions of length < n, intersect 
trivially. 

2 We thank M. Domokos for pointing out this fact to us. 
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The third step is to observe that the linear map / defined by a —>• sgn(<r)<r where a € S^, sends 
the two-sided ideal corresponding to A, to the two-sided ideal corresponding to A, the conjugate of 
A. At the same time, / preserves every 1-dinrensional subspace in J, as sgn(cr) for any permutation 
a £ J is the same. Thus when d > re 2 , J and I< n have to intersect trivially: by contradiction, 
suppose a nonzero v E J PI I< n . Then f(v) = ±v, thus v € I< n fl /(/< n ). However, /(/<„) is 
spanned by the two-sided ideals of width < n. When d > n 2 , there does not exist a partition with 
both length and width < n. The n 2 degree bound then follows. 

Overview of the structural results of R(n,m). To carry on the above strategy for S(n,m), a 
first step is to give the multilinear invariants in R(n , m) a combinatorial description, which we take 
from [ANS07] . Since in this case we have multilinear invariants only when n divides m, we focus 
on R(n, dn ) in the following. Briefly speaking, we can identify a natural spanning set of multilinear 
invariants with n-regular bipartite graphs with d left (resp. right) vertices. 

Then it is necessary to obtain the second fundamental theorem (SFT) for R(n,dn). While 
certainly known to experts, we could not find an explicit statement in the literature, so we prove it 
in Proposition HI Then we need to describe the FS*i bimodule structure of R(n, dn) (Fact [4]). This 
is possible since R(n, dn) can be viewed as a tensor product of two column tabloid modules |Ful97L 
Chap. 7.4]. So b is a degree bound, if whenever dn > b, with the help of relations, those multilinear 
invariants indexed by connected bipartite graphs can be written as a sum of those ones indexed by 
disconnected bipartite graphs (Proposition [9]). Furthermore, a natural bilinear form on R(n,dn) 
can be defined by setting the standard elements in R(n, dn) as forming an orthonormal basis (Fact 0] 
(3)). This bilinear form makes sense w.r.t. the bimodule structure as well due to James’ submodule 
theorem (Fact El). Up to this point, we successfully parallel everything in the S(n,m) setting. 

What is missing is an analogue of the linear map / - this prevents us from completing this 
strategy. Nevertheless, what we’ve develop allows us to prove that for b to be a degree bound, then 
b must be fl(n 3 / 2 ) (Proposition 1111) . 

To make progress, we exploit in depth the diagonal action of Sd n on R(n,dn). Fortunately, 
the orbits under this action can be identified with row tabloid modules Fnl97 . Chap. 7.2], From 
this structure we obtain new forms of relations (Equation El)- We then prove that when b > dn, 
using these relations, each multilinear invariant indexed by a connected and simple (e.g. with no 
multiple edges) bipartite graph, can be written as a sum of those indexed by non-simple graphs 
(Proposition 112(1 . This provides a non-trivial reduction result in the spirit of the degree bound 
statement. 

We believe that the results we prove in this paper, will be useful to finally get a good degree 
bound for R{n,m), over fields of characteristic 0. 

3 The second fundamental theorem 

Reduction to the multilinear case. Over characteristic 0 fields, the well-known two proce¬ 
dures, polarization and restitution, reduce many questions for general invariants to multilinear 
invariants. 

For completeness, let us demonstrate this in the case of R(n,m) C ¥[x[ k j^]ke[m\,i,je[n\- Let 

Xk = (a -i k j)i,je[n]- Recall that each / € R(n, m) has degree divisible by n. Suppose £(X i,..., X rn ) = 
Yli a ifi forms a relation, where /,; € R(n,m). W.l.o.g. we can assume the /)’s are multi- 
homogeneous, that is for every fixed j € [m], the degrees of /j’s w.r.t. Xj are the same for every i. 
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Otherwise, we can divide i into the multi-homogeneous components, and each component will again 
be a relation. Suppose deg(A\.) in t is c4, then Y2,k dk = dn for some d E Z + . Then for each k E [m], 
we introduce d & copies of Xk as .XAu,..., X^ kdk ), as well as dk variables 2 /^, 1 5 ■ • • ,Uk,d k ■ Now con¬ 
sider the coefficient of n„ j Ui,j in ^(-X”(l, 1)2/1,1 + ' • •+^(l,di)2/l,di) • • • , ^(m,l)2/m,l + ' • '+X(m,dm)ym,dm ), 
and let it be £' - the polarization of i. It can be seen that £' is a multilinear invariant in dn matrices, 
and £' is a relation as well. Now we modify £' by substituting by X % . and let the result be £" 

- the substitution of £!. Over characteristic 0 fields, we see that i" = W^d^l. As demonstrated 
above, since every relation can be obtained by restituting some multilinear relation, it is enough to 
understand multilinear relations. 

Multilinear invariants of R(n , m). As invariants in R(n, m ) are of degree divisible by n, R(n , m) 
has a multilinear invariant if and only if n divides m. In the following we consider R(n, m) for 
m = dn where d E Z + . We use the descriptions of invariants in R(n,m ) given in |ANS07| . We 
reformulate their results here. 

We define a set of symbols P = \ii,... ,i n \, ij E [dn] with the anti-symmetric property: 

1^1? • • • ? • • • j ^6? • • • ? ^n| = |ii, . . . , Z5, . . . , , Z n |. 

Note that as F is of characteristic 0, the anti-symmetric property implies that if there exist ij = i^ 
for j ^ k then \i\, ..., i n \ =0. 

Then let V be F(P), the noncommutative polynomial ring with variables from P. A degree-d 
monomial in V is of the form 


1*1,1 ;■■■> *l,Tl| ° ° |*d,l , • • ■ , *d,n | ; 

where o denotes the noncommutative product. We often record it as an d x n tableau 


* 1,1 * 1,2 • • • h,n 

*2,1 *2,2 • • • *2 ,n , 

: ■ : • « 

id, 1 id ,2 * * * id,n 

A degree-d monomial |*u,. .., i\ yn \ o ■■■ o \id,i, ■ ■ ■, id,n\ is repetition-free, if {ij,k,j £ [d],k E 
[n]} = [ nd ]. Let V(d) be the F-vector space of spanned by degree-d repetition-free monomials in V. 

Likewise we define another set of symbols P = \i \,..., i n \, ij E [nd] satisfying also the anti¬ 
symmetric property, and define V. and V(d) as before. The vector space V(d) © T’(d) then has a 
basis S ®T, S £V(d) and T E V(d). 


Now consider the vector space (V <8> W)®* 1 where V = W = F n . This is spanned by vectors 
of the form (ui <8> wi) © ■■• © (vd n © Wdn ), E V, Wj E W. Given *i,*2,•••,*« £ [dn], we 

define a function || ||, on (ui © w±) © ■ ■ ■ © ( Vd n © Wd n ), as follows. || ii, ■ ■ ■ ,i n || sends 

(ui © w i) © ■ ■ ■ © ( Vd n © Wdn ) to the determinant of the n x n matrix [u^,..., Vi n ]. Likewise, define 
|| j i,... ,j n ||, jk € [dn] sending (ui © wi) © • • • © (Vdn © Wdn) to the determinant of the matrix 

[wj 1 ,... ,Wj n \. || || is known as a Pliicker coordinate. We then consider the product 

function 


* 1 , 1 ,• 


, *l,r 


*d, 1, • • • , *d,, 


Jl.l, 




Jd,l, 


, Jd,, 
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where {ik,e,k E [d],£ € [n]} = {jk,e,k E [d],£ E [n]} = [nd], and extend this function to all of 
(V <8 IT)®* 1 by linearity. This gives rise to a multilinear function on (V <8 IT)®* 1 . 

We then define a linear map cj) from V(d) <8 V{d) to multilinear functions on (V <8 IT)®* 1 , by 
sending 

(1*1,1) ■ ■ ■ i *l,7i| ° ■ ■ ■ ° |*d,l? ■ ■ ■ ? *d,7i|) *8 (|Jl,l, ■ ■ ■ , Jl,n| ° ■ ■ ■ ° |jd,l? ■ ■ • > Jd,n|) 

to || *1,1? • • • ? *l,n || ' ■ ■ ■ ' || *d,l: ■ ■ ■ ? *d,n || ' || Jl,l ? ■ ■ • i jl,n || ' • • • ' || jd, 1? ■ ■ ■ ? Jd,n || (2) 

and extend by linearity. It is understood that if we write 4> to apply to monomials in V(d) or V{d), 
we replace | • | to || • ||. It is not difficult to observe (see e.g. [ANS07] ) that the image of 4> are 
precisely the multilinear functions on (V <8 IT)®* 1 which are invariant under the natural action 
SL(n,F) x SL(n,F). 

The formalism above is necessary for the formulation of the second fundamental theorem, which 
amounts to describe the kernel of cj). 

Second fundamental theorem for multilinear invariants. In this part we describe the kernel 
of (f). It is clear that when d = 1, ker(</>) is 0. In the following we assume d> 2. 

Recall the Pliicker relations for Pliicker coordinates: for i±,... ... ,j n E [nd] and k E [n], 

we have 


|| *1 ? • • • ? *7i || ' || Jl) • • • ) jn || 

= 'y ] II *1? ■ ■ ■ ? jl ) • • • ) jkt ■ ■ ■ ? *n || ■ || *S 1 : • ■ ■ ? *Sfc : jk+ 1? ■ ■ ■ ijn || ? 

l<Si<S2<-”<Sfc^ n 

where j\.... ,jk are in positions s\,... ,Sk- For a monomial T in V(d) recorded as in Equation [U 
following Fulton jFul97. pp. 970, we use 7r i)fc (T) to denote the vector in V(d) of the form 
where T' runs over the tableaux obtained by switching the first k elements in the j + 1th row, with 
k elements in the jth row of T. Let JC(d) be the span of all T — where T E V(d), j E [n— 1], 

and k E [n\. In particular, note that when k = n, 7r^ n (T) is just switching the jth and j + 1th 
row. So though monomials in V{d) are non-commutative, modulo K-(d) they become commutative, 
which is consistent with the image of cj). 

A monomial T in V(d) is standard , if its tableau is (strictly) increasing in each row and each col¬ 
umn. Following a procedure called the straightening, it is well-known that the standard monomials 
form a basis for the quotient space V(d)/JC(d). 

Similarly we have IC(d) in V(d). By Pliicker relations, we know that (AT(d)<8P(d)UP(d)®/C(d)) 
lies in ker(0). (Recall that (•) denotes the linear span.) We show that these two sets are equal. 

Proposition 1. Let notations be as above. We have (JC(d) (8 V(d) U V(d) <8 K.(d)} = ker(0). 

Proof. It remains to prove that ker(</>) C (IC(d) <8 V{d) U V(d) <8 IC(d)). Let l be in ker((/>). Suppose 
£ is _ 

ciS 1 ! ® Ti + C 2 S 2 <8 T 2 -\ -+ CkSk <8 T k 

where cf s are in F, and Sfs (resp. Tf s) are monomials in V(d) (resp. T(d)). Note that Sf s (resp. 
Tf s) are not necessarily distinct. We first arrange according to Sf s, and write £ as 

S\ 8> h\ + S 2 <8 /*2 + • • • + Sk' <8 hk' 

3 With a slight change from column as in [Ful97l to row here. 
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where hi E 'P(d), and Si, ..., are distinct. Modulo the space /C(d) ©'P(d), we express each Si as 
a linear combination of those ones indexed by standard tableaux. Then by re-grouping according 
to the standard tableaux, £ is expressed as 


S[ © h[ + S' 2 © ti 2 + ■ ■ ■ + S' p © ti p 

where h! i E V(d), and S\, .... S' p are standard and distinct. Now we claim that every S[ © h[ must 
be in V(d) © K,{d). W.l.o.g. assume S[ © h' x is not, that is h\ is not in K{d). Then 4>{h \) is not a 
zero function on W® nd . So there exists some w = w± © W 2 © • • • © w n d s.t. (f)(h\)(w) ^ 0. Now we 
restrict cj)(£) to the set (V © w\) © ■ ■ ■ © (V © w n( i) which yields 

4>(S[) ■ w) + <j>(S' 2 ) ■ (t>{h' 2 ){ w) H-1- (j>(Sj) ■ 4>(h'j)( w). 

Since S'^s are standard and distinct, (j > (S , i y s are linearly independent as functions on V® dn . As 
4>{h , 1 )( w) 7 ^ 0, the restriction of £ on (V © w{) © • • • © (V © w n d), as a function on V® d , is nonzero. 
This contradicts £ being a zero function on (V <S> W)® dn . So every S[ <S> h' t is in V <S> JC. From this 
we conclude that £ € (IC(d) © V(d) U V(d) © JC(d)}. □ 

4 The FSdn bimodule structure 

There exists a natural action of Sd n x S d n on V{d) © V{d). Suppose S and T are monomials in V 
and V respectively. Then (a, r) € S dn X S dn sends S © T to S a © T r , where cr and r permute the 
entries of S and T. This endows V(d) ®V(d) an ¥S n d bimodule structure. This action descends to 
the relations describing the second fundamental theorem: if £ is a relation (in ker(</>)), then £( <T ’ T ) 
is also a relation. So ker(0) is an ¥Sd n sub-bimodule, whose structure will be described in the 
following Fact [4j 



module as shown in ( Ful97 . Chap. 7.2], We collect some basic facts about the column tabloid 
module adapted to our setting. Recall that for a nonnegative integer s, a partition of size s is 
a non-increasing sequence of nonnegative integers A = (Ai,...,A^), with = s - This is 

denoted as A h s. We identify (Ai,..., A^) with (Ai,... , A^, 0,..., 0); namely any trailing zeros 
are assumed implicitly if required. The conjugate of A denoted by A is a partition of s with 
A i = \{j E [£} s.t. A j > z}|. The height of A is max(z E [£] \ A i / 0), and the width is Ai. 
Partitions are usually represented using Young diagrams: that is a concatenation of rows of boxes 
arranged to be left aligned, with the ith row having \ boxes. For two partitions v = {y\,, v^) 
and A = (Ai,..., A^), u dominates A if for any j E Z + , J2i =i u i ^ J2i =i We use d n to denote 
the partition (d,...,d) of height n. For a partition A of [dn], Sp(A) denotes the Specht module 
corresponding to A over F. 

Fact 2 ( |Ful97l Chap. 7.4]). 1. As an FS^n module, V(d) decomposes as 

Sp (^)ffi(® Sp(A)® c >), 

A \-dn 

4 Note that the notation in lFuI97| is a bit different from ours; there the column tableux satisfy anti-symmetric 
property along the columns. So one needs to flip the tableaux here to match the results there. 
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where A runs over partitions of dn strictly dominated by d n , and c\ = K-? d , the Kostka 
number w.r.t. A ,n d . 

2. Given a monomial S G V(d) represented by a tableau, the column subgroup of S, cl(S') is 
the subgroup of Sdn that preserves the columns of S. The (unsigned) column symmetrizer 
c(S) = Sjred(S) 7r - Then £{d) := (c (S) ■ S \ S G V{d) are monomials ) = Sp(d n ). 

3. )C(d) is a submodule ofV(d) and is isomorphic to © Ahdn Sp(A)® Cx . 

We also observe the following adaptation of James’ submodule theorem to column tabloid 
modules. 


Fact 3 (See e.g. |Sag01[ Theorem 2.4.4]). Let (3' be the bilinear form on V(d) by setting the mono¬ 
mials as an orthonormal basis. Then JC(d) and£(d) are orthogonal complement of each other under 
/?'. 


Thus, as an FS^ bimodule, V(d) © V(d) is the tensor product of two column tabloid modules. 
Its structure is then easily deduced from Fact [2] and [3j 

Fact 4. 1. As an FSrf n bimodule, V(d) ©'P(d) decomposes as 


(Sp(cT) © Sp (d n )) © ( 0 (Sp(A) © SpH)®^), 

\v\-dn 


where A and v are dominated by d n , and at least one of X and v is strictly dominated by d n . 
c\ and c v are Kostka numbers as in Fact 0 

2. ker(</>) = (JC(d) © V{d) U V{d) © IC(d)) is an FSrf n sub-bimodule, isomorphic to the summand 
©A^i-dn( S P( A ) ® Sp(z/))® c >' c % as above. 

3. Let (3 be the bilinear form on V(d) © V(d) by setting S©T, where S and T are monomials in 
V(d) andV(d), respectively. Then £(d)®£(d) and (V(d)®lC(d)GlC(d)®V (d)) are orthogonal 
complement of each other under (3. 


5 The diagonal action of S^ n 

In this subsection we consider the diagonal action of S dn on V(d) ®V{d). That is, a G S dn acts on 
V(d) © V(d) as (a, a) G S dn x S dn - 

To understand this structure we introduce a combinatorial structure called the correlated 
tableaux. A correlated tableau C of size dxd is & dxd matrix whose entries are subsets of 
[dn], satisfying: (1) C(i,j), i,j G [d] form a partition of [ dn] ; (2) Vi G [d], | C(i,j) | = n; (3) 

Vj G [d], | U ieM C(i,j)\ = n. 

Correlated tableaux are in 1-to-l correspondence with S © T where S and T are monomials 
from V(d) and V(d), respectively. Given S © T, we can form a correlated tableau C by setting 
C(i,j ) = {k G [nd] \ k G Si and k G Tj}. Given a correlated tableau C, by reading along the rows 
from top to bottom we get a monomial S, and by reading along the columns from left to right (and 
adding ®) we get a monomial T. 





Let us set up the following convention about arranging the rows of S (resp. T) as in S <g> T, 
where S and T are understood as tableaux as shown in Equation [T] In the literature, it is more 
common to arrange each row of S to be increasing, namely ij i < ij t 2 < • • • < ij, n for every j. But 
when S appears in S®T we shall order each row while taking into consideration of T as well. That 
is, first form the correlated tableau C based on S <S)T, and then read each row of C from left to 
right. If there are more than 1 elements in an entry, then use the increasing order. The rows of T 
are also ordered in a similar way. 

As an example, consider 


s = 

1 2 4 

3 5 6 

,T = 

1 2 7 

3 4 5 


7 8 9 


6 8 9 


The correlated tableaux then is C 


{1,2} 

{4} 

0 

0 

{3,5} 

{6} 

m 

0 

{8,9} 


So we obtain by reading C 


1 2 4 


1 2 7 

3 5 6 

,T' = 

4 3 5 

7 8 9 


6 8 9 


That is T' = —T. 

Having defined correlated tableaux, we define the (d,n)-correlated diagram as a d x d matrix 
D over N with row and column sums being n. For a correlated diagram D, we form its diagonal 
partition 5(D), by arranging the entries of D to be non-increasing to get a partition of dn. Each 
correlated tableau C gives rise to a correlated diagram Dc by setting Dc(i,j) = \C(i, j)|; we all call 


Dc the shape of C. Using the example as above, Dc = 


2 

1 

0 

0 

2 

1 

1 

0 

2 


5(Dc) then is (2,2, 2,1,1,1). 


Fix a correlated diagram D, and let C(D ) be the subspace of V(d), spanned by all correlated 
tableaux of shape D. C(D) is clearly a submodule of V(d) under the diagonal action of S dn- Let 
us define the row tabloid module of FS s , s € Z + . For a partition Ahs, row tabloids of shape A are 
fillings of the Young diagram of shape A by [s] without repetitions, with the following equivalence 
relation: two row tabloids P and Q are equivalent if the corresponding rows have the same entries. 
Row tabloids admit an FS s action by permuting the entries. The linear span of all row tabloids 
of shape A is then an FS s module and denoted as 77(A). The decomposition of the row tabloid 
module is well-known and quite similar to that of the column tabloid module as shown in Fact [2j 
see jFu!97l Chap. 7.2], 


Lemma 5. As an FSrf n module, C(D ) is isomorphic to the row tabloid module 1Z(5(D)). 


Proof. To set up a linear map between C(D) and 1Z(5(D)), arrange the entries of C € C(D) 
following an arbitrary but fixed order as long as this order maintains the sizes of the entries to be 
non-increasing. This gives a row tabloid in 1Z(5(D)), and then extend by linearity. To see that 
the actions of S dn are compatible, note that by our convention of sending correlated tableaux to 
monomials, the diagonal action of Sd n is just permuting the entries in the correlated tableau. In 
particular, if i and j are in the same entry of C, then switching i and j leaves C unchanged, as two 
— 1 are produced from the two monomials associated with C. □ 
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Proposition 6. As an FS*i module, V(d)®V{d) is isomorphic to ©£>77(<5(IA)), where D runs over 
all (d,n)-correlated diagrams. 

We also note that the well-known Robinson-Schensted-Knuth correspondence gives a 1-to-l 
correspondence between (d, n)-correlated diagrams and pairs of semistandard tableaux with the 
same shape A, and the same content as n d . Note that A necessarily dominates n d to satisfy the 
semistandard condition. On the other hand, such pairs of semistandard tableaux could be used to 
index those FS^ n sub-bimodules of V(d) <8> V(d) of isomorphic type Sp(A) <8> Sp(A). This follows by 
adapting the results in | ISag01[ Chap. 2.10] to the column tabloid setting. 

Recall that </> realizes V(d) <g> V(d) as functions on (V <8> W)® dn , as dehned in Equation [2j We 
now describe another set of vectors in ker(</>) based on the FS dn structure. Given a correlated 
tableau C, suppose C has > n 2 + 1 nonempty entries. Note that this requires d > n + 1. Fix 
N := re 2 + 1 entries and from each entry choose a number, denoted as I = {zi,... ,i n 2 +1 }. Then 
consider the following vector in V(d) ®V(d)\ 

Alt (C,I)= ^sgn(a)C CT , (3) 

crgS/ 

where a acts on C as in the diagonal action of S^ n . Then </>(Alt(C,/)) is a zero function on 
(V <8> W)® dn , since it is alternating in n 2 + 1 copies of V <8> W, an n 2 -dimensional vector space. Note 
that construction is very natural in light of the identification of C(D) with 1Z(5(D)). 

6 Towards proving degree bounds 

Reduction to the multilinear case. Over characteristic 0 fields, we can also reduce the degree 
bound problem to the multilinear case as well. Consider a graded invariant ring R, for which we 
want to prove that R is generated as a ring by R<b = {f £ R | deg(/) < b}. If R is generated by 
R<b, then any homogeneous multilinear invariant / of degree b' > b is equal to fcj where 

b > deg (fi j) > 1. On the other hand, suppose each multilinear invariant of degree b' > b can 
be written as such. Take any homogeneous g € R of degree b' > b, fully polarize it to get a 
multilinear /, which then by assumption can be written as X/* 11/ /*j where b > deg (fij) > 1. 
Then restitute back to get b'\g on one hand, and on the other hand YliWj 9i,ji where gij are 
obtained via restitution to fij. So b > deg {gij) = deg (fij) > 1, which implies that R is generated 
by R<b- 


Some reducible polynomials in V{d) <8) V(d). Recall that V{d) (8) V(d) is spanned by S <g) T, 
where S and T are monomials; or equivalently, by correlated tableaux C. We now view C as a 
bipartite graph on ( L U R, E) allowing multiple edges, where L = R = [d], and there are k edges 
between i and j if C(i,j) = k. 

We shall prove that <p(C) is reducible if and only if this graph is disconnected. To show this, 
we recall the matrix-theoretic interpretation of 4>(C) as in [ANS07] . Let A ±,..., A s be s variable 
matrices of size sxs, and yi,... ,y s are newly introduced variables. The mixed discriminant rndisc 
is a polynomial in s 3 variables in Aj’s, defined as follows. Given a £ S s , let A a be the matrix whose 
ith column is the ith column of A a ^y Then mdisc(Ai,..., A s ) = X^o-es s det(A ff ). 

Given a correlated tableau C , we form dn matrices of size dn x dn Yi,, Yd n as follows. For 
every i, Yi is viewed a d x d block matrix, where each block is of size n x n. If i appears in the 
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(j. A:)tli position of C, then Yj has X t at the (j, k ) block, and 0 everywhere else. In particular note 
that by construction mdisc(Yi,... ,Yd n ) is a multilinear polynomial in Xj’s. 

Theorem 7 ( [ ANS07j l. 0(C) is equal to mdisc(Yi,..., 10 n ). 

Proposition 8. 0(C) is a reducible polynomial if and only if C is a disconnected graph. 

Proof. If C is disconnected, then 0(C) is a product of at least two multilinear polynomials of 
smaller degree, each of which corresponds to a connected component. On the other hand, if C is 
connected, let us assume that 0(C) can be written as / • g. Without loss of generality assume / 
is not constant, and let us show that g is constant. As / is not constant, some variable Xjfa,b ) 
appears in /. Suppose a variable X p (c,d ) appearing in 0(C) does not appear with Xj(a,6) in any 
monomial. Since 0(C) is multilinear and Xj(c, d) appears in 0(C), the only way this can happen is 
if Xj(c,d ) is also in /. We say Xj(c,d) connects to Xj(a, 6) if this happens. Note that if X^(u,v) 
connects to Xj(c,d ) and Xj{c,d) connects to Xj(a,6), then A0(tt,u) is in / as well. It is not hard 
to see from mixed discriminant perspective that: (1) every variable in X,- connects to Xj(a, 6); (2) 
if Xj is in the same row or column as Xi in C, then every variable in Xj connects to some variable 
in Xj. Since C is a connected graph, using (1) and (2) iteratively we put every variable in /, which 
implies that g is a constant. □ 

Let D(d) be the span of disconnected correlated tableaux in V{d) ®V(d), and &(d) the span 
of connected correlated tableaux. 

Using the second fundamental theorem. A degree bound of, say, degree bn, would involve 
showing that every invariant of degree dn, d > b, is equal to a sum of products of invariants of 
smaller degree, modulo relations in 0. Algorithmically we would start with an invariant of degree 
d, and express it as a sum of products of invariants of smaller degree plus something in the kernel 
0. If in this new expression there still remain invariants of degree bigger than b, we write those as 
sums of products of invariants of smaller degree plus elements in the kernel 0. We continue, till all 
invariants involved in the final expression have degree less than or equal to bn. 

We are then led to examine the following situation - if B is a set of polynomials spanning 
R(n, dn), is every irreducible polynomial in B equal to a sum of reducible ones in B plus something 
in the kernel of 0? If this happens beyond a certain degree bn, we get an upper bound of bn, on the 
degree in which the invariant ring is generated. Taking B as 0(C), where C runs over all correlated 
tableaus, we summarise the above argument in the following 

Proposition 9. R{n,m) is generated by R(n,m)<bn if and only if for every d > b, D(d) U ker(0) 
spans V(d) <g> V(d). 

Using the bilinear form 0 introduced in Fact [4] (3), we have the following. 

Corollary 10. R{n,m) is generated by R(n,m)<bn if and only if for every d > b, £(d) n (£(d) <S> 
£(d)) = 0. 

We now use some results developed so far to lower bound the degree in which the invariant ring 
can be generated as an algebra. 

Proposition 11. If R(n,m) is generated in degree bn, then b = I2(n 1//2 ). 
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Proof. We use proposition [9j First observe that for £>(<i)Uker(0) to span V(d)®V{d) it is necessary 
that dirn(2)(d)) + dim(ker(0)) > dim (V(d) <g) V(d)). We show that when d < o{n 1 / 2 ) this does not 
happen, by doing a dimension count. To compute the dimensions proceed as follows. 

1. dim (V(d) <g> V{d)) = D 1 := (gp) 2 ; 

2. By the hook length formula, dim(£(d) <g> £{d)) = Ih := ( (i.,,,. n ).(2.....(n+i))-'..-(d-...-(d+n-i)) ) 2 i 

3. dim(ker(0)) = D\ — D 2 ; 

4. A lower bound on dim(2)(<i)) is (This counts only those disconnected graphs with 
C(l, 1 ) = { 1 , ■ ■ ■ ,n}. 

5. An upper bound on dim(2)(d)) is 



((d - fc)n)h 2 
( n\) d ~ k 1 


where fixes k rows and columns as a connected component C, chooses kn numbers 
as the labels within C, and the rest terms count the number of ways filling in C and the other 
component. 

Using (3) and (5), we prove that when d = o{y/n) and n is large enough, then dim(2)(cZ)) < 
dim (V(d) <S> V(d)) — dinr(ker(0)), which implies that the degree bound is necessarily hl(n 3//2 ). For 
this we need to verify that 


ST' (d\ 2 (dn\ , (kn )\ ,2 A(d - k)n)\ s 2 , _ (nd)\ _ ^2 

“ \k) \kn) (n!) fc ( n\) d ~ k < (1 • ... • n) • (2 • ... ■ (n + 1)) • ... • (d ■ ... ■ (d + ra — 1)) 


for d = o(y/n). It is easy to see that this is equivalent to showing that 

AA 2 (kn)\((d- k)n)\ . _ (rdf _ ,2 

“ \k) ( dn)\ Tl • ... • n) • (2 • ... • (n + 1 )) • ... • (d ■ ... ■ (d + n — 1 )) 

Upper bounding the left hand side by 2 2d ■ — and rearranging the terms we need to show 

that 

(dn)\ , d _ (1 • ... • n) • (2 • ... • (n + 1 )) • ... • (d • ... • (d + n - 1 )) v 2 

n\((d — l)n)! ' ( n\) d 

The left hand side is at least ^ dn ~ n+ ^y---d dn ) anc j ^ ie right hand side is at most 

[2 d • (n + l ) d_1 • (n + 2) d ~ 2 ■ ... ■ (n + d - l)] 2 , 


which is in turn at most 

[2 d - (n + d- l)©] 2 . 


It is easy to verify now that when d = o( v / n), (dn — n + 1) ■... • (dn) is asymptotically larger than 
n n • [ 2 d • (n + d — l)®] 2 . □ 


12 
















On the other hand, using (4) and (5) it is easy to deduce that when d = uj(s/n) then dim(2)(d)) > 
Z?2 asymptotically in n, and when d > An then dirn(2)(d)) > D -2 unconditionally. This means that 
once d is Ll(n), then dim(S)(d)) + dim(ker(</>)) > dim('P(d) <g )V(d)), clearing a bottleneck to prove 
a polynomial degree bound for R(n,m). 

On connected graphs without multiple edges. Recall that by Proposition [9] we need to 
consider whether D(d) U ker(0) spans V(d) <S> V(d) for large enough d. In this section we prove an 
analogous, but weaker result. 

Let 6(d) be the span of those correlated tableaux that are connected, and simple; that is, with 
no multiple edges. 6(d) is clearly a subspace of <£(d). Let Wl(d) be the span of the tableaux that 
are not in 6(d). Namely those tableaux spanning 9Jt(d) are either disconnected, or connected and 
with at least one multiple edge. In particular they include any tableau with at least one multiple 
edge. 

In the proof of the following proposition, besides the relations JC(d) <8> P(d) and V(d) <g> /C(d), 
we crucially use those ones as described in Equation [3j We then refer to relations in IC(d) <8> V(d) 
and V(d) <S> K(d) as the first type, and relations from Equation [3] as second type realtions. 

Proposition 12. Let d > n + 1. Then V(d) <g> V(d) is spanned by 911(d) U ker(</>). 

Note that if we define il(d) the span of correlated tableaux whose graphs have at least one 
multiple edge, then this proposition is just to say that when d > n, 39(d) U ker(^) U il(d) spans 
V(d)®V(d). 

Proof. Take any correlated tableau C E 6(d) ; namely the bipartite graph associated to C is 
connected and without multiple edges. We shall show that C can be written as a linear combination 
of tableaux with multiple edges via the help of ker (<f>). 

To do that let G be the bipartite graph of C, and T be the set of edges of G. (T is of course 
labelled by [nd], but we make T explicitly for clarity.) Note that |Tj = dn > n 2 + 1. For some 
a E St, suppose o = T 1 T 2 ... where r* is a transposition switching e and e' in T. As G is 
connected, for each pair of edges e, e' € T, e is connected to e' by some path in G. Thus r* can be 
further decomposed as a product of transpositions consisting of edges along that path connecting 
e and e'. That is for each i we have r t = 7i"j i7Tj 2 • • • where 7switches two edges sharing a 
common vertex. 

Consider now 7Ti ; i, switching adjacent edges e and e', and suppose the position of e is (s,ii) 
and e! is (s,t 2 ), s,t±,t 2 € [d] and t\ < t 2 ■ That is, we assume e and e' are in the same row as in 
the correlated tableau. By the relations of the first type, we move e to the ^th column along the 
sth row and get C = —C 771 ’ 1 + ]TL Dj, where Di E D3l(n, d). We explain the terms on the RHS: the 
— 1 sign before C 771 ’ 1 is because when we switch e and e', the order of reading the rth row changed: 
from first reading e and then e' to first e' and then e. Note that C 771:1 is of the same shape as C. 
Df s are in Wl(n,d) because as long as e and e' are not switched, we would have e and e! both at 
position (s, t 2 ). (There might be some —1 before Df s too, but this can be neglected.) The example 
given after this proof illustrates this calculation. 

We then apply other 7 t m -’s to C™ 1 ’ 1 sequentially; each application of 7rjj would yield a bunch 
of Df s in Wl(n,d). At last we shall get C = sgn(<r)C' CT + )>T D, where Di E Wl(n,d). For every 
a E St such an equation can be derived (when a = id use C = C), and we have |Tj!C = 
So-eS T sgn(cr)C' CT + Plugging the second type of relations in, and noting that \T\ > n 2 , 

C = ppjj (£N Df) where Di E 9Jt completing the proof. □ 
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Example 13. In this example 


s = 

1 2 

3 4 

,T = 

1 3 

2 5 


5 6 


4 6 


The corresponding graph is 



The kernel has the following relation coming from the right monomial. 


1 3 


2 3 


1 2 

2 5 

= 

T 5 

+ 

3 5 

4 6 


4 6 


4 6 


Multiplying this by S and recalling our convention on associating monomials to correlated tableaus, 
we have 


i 





To illustrate the utility of Proposition 1121 developed so far, we consider matrix semi-invariants 
for 2x2 matrices. For 2x2 matrices, note that as soon as there exists a multiple edge, a 2-regular 
bipartite graph is disconnected. Therefore, an immediate application of Proposition [12] gives the 
following known result. 

Theorem 14. The matrix semi-invariants of 2x2 matrices are generated by those of degree < 4. 

For 2x2 matrices, Domokos presented an explicit generating set, and from this description he 
deduced that (3 = 4 [DomOPa] . Therefore our bound is tight in this case. Also note that our result 
is obtained without computing a single invariant. 
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Partha Mukhopadhyay and K. N. Raghavan for discussions related to this work. Part of the 
work was done when Youming was visiting the Simons Institute for the program Algorithms and 
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